Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria
نویسندگان
چکیده
Reverberation-robust speech recognition has become very important in the recognition of distant-talking speech. However, as no common reverberation criteria for the recognition of reverberantspeech have been proposed, it has been difficult to estimate this. We have thus focused on a reverberation criterion for the recognition of distant-talking speech. The reverberation time is generally currently used as a reverberation criterion for the recognition of distant-talking speech. This is unique and does not depend on the position of the source in a room. However, distant-talking speech recognition greatly depends on the location of the talker relative to that of the microphone and the distance between them. We investigated a suitable reverberation criterion with the ISO3382 acoustic parameters for distant-talking speech recognition to overcome this problem. We first calculated distant-talking speech recognition with early and late reflections based on the impulse response between the talker and microphone. As a result, we found that early reflections within about 12.5 ms from the duration of direct sound contributed slightly to distant-talking speech recognition in non-noisy environments. We then evaluated it based on ISO3382 acoustic parameters. We consequently confirmed that the ISO3382 acoustic parameters are strong candidates for the new reverberation criteria for distant-talking speech recognition.
منابع مشابه
Performance estimation of reverberant speech recognition based on reverberant criteria RSR-dn with acoustic parameters
Reverberation-robust speech recognition has become very important in the field of distant-talking speech recognition. However, as no common reverberation criteria for the recognition of reverberant speech have yet been proposed, it has been difficult to estimate its effectiveness. To address this problem in 2007, we investigated early and late reflections on distanttalking speech recognition to...
متن کاملDistant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm
We propose a blind dereverberation method based on spectral subtraction using a multi-channel least mean squares (MCLMS) algorithm for distant-talking speech recognition. In a distant-talking environment, the channel impulse response is longer than the short-term spectral analysis window. By treating the late reverberation as additive noise, a noise reduction technique based on spectral subtrac...
متن کاملDeep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification
Deep neural network (DNN)-based approaches have been shown to be effective in many automatic speech recognition systems. However, few works have focused on DNNs for distant-talking speaker recognition. In this study, a bottleneck feature derived from a DNN and a cepstral domain denoising autoencoder (DAE)-based dereverberation are presented for distant-talking speaker identification, and a comb...
متن کاملTemporal-spatial processing of a single speech reflection in normal-hearing and hearing-impaired listeners
Reflections and reverberation considerably affect speech recognition in rooms. Early reflections of the speech signal have mostly been found to enhance speech intelligibility [1, 2, 3, 4], while late reflections can have detrimental effects [4, 5]. A better understanding of the mechanisms underlying speech perception in noisy and reverberant conditions can be important for various practical inv...
متن کاملA Combined Approach for Estimating a Feature-domain Reverberation Model in Non-diffuse Environments
A combined approach for estimating a feature-domain reverberation model suitable for the robust distant-talking automatic speech recognition concept REMOS (REverberation MOdeling for Speech recognition) [1] is proposed. Based on a few calibration utterances recorded in the target environment, the combined approach employs ML estimation and blind estimation of the reverberation time to determine...
متن کامل